The dataset consists of 303 observations and 14 variables in total. We have :
7 categorical variables: sex, cp(chest pain type), fbs(fasting blood sugar exceeds 120 mg/dl or not ), restecg(resting electrocardiographic results), exang(presence of exercise-induced angina), slope(the slope of the peak exercise ST segment), thal(thallium stress test).
7 numerical variables: age, trestbps(resting blood pressure), chol(serum cholesterol levels), thalach(maximum heart rate), oldpeak(distance between baseline of ST segment), ca(number of colored major blood vessels), num(diagnosis of heart disease)
The purpose of our analysis is to uncover key patterns and risk factors associated with coronary heart diseases. We aim to identify factors that highly correlates with heart disease and promote early disease intervention in patients.
We identified several key predictors of heart disease, including the type of chest pain (cp), number of detectable vessels (thal), major vessels colored by fluoroscopy (ca), and ST depression (oldpeak).
In clinical practice, the diagnosis of heart disease often involves individual differences and a multitude of complex factors. The correlated factors can provide important reference points for clinicians in the diagnostic process, helping them to identify potential risk factors for heart disease at an earlier stage, thereby improving patients’ survival chances and overall health.
In summary, our study highlights the potential value of data analysis in enhancing the diagnosis and treatment of heart disease.
Janosi,Andras, Steinbrunn,William, Pfisterer,Matthias, and Detrano,Robert. (1988). Heart Disease. UCI Machine Learning Repository. https://doi.org/10.24432/C52P4X.
Detrano, Robert C. et al. “International application of a new probability algorithm for the diagnosis of coronary artery disease.” The American journal of cardiology 64 5 (1989): 304-10.
Detrano R, Salcedo EE, Hobbs RE, Yiannikas J. Cardiac cinefluoroscopy as an inexpensive aid in the diagnosis of coronary artery disease. Am J Cardiol. 1986 May 1;57(13):1041-6. doi: 10.1016/0002-9149(86)90671-5. PMID: 3706156.
Ayatollahi, Haleh et al. “Predicting coronary artery disease: a comparison between two data mining algorithms.” BMC Public Health 19 (2019): n. pag.
Longo D, Fauci A, Kasper D, Hauser S. Harrison’s principles of internal medicine. 18th ed. New York: McGraw-Hill Professionals; 2011.